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Why? 


e National Records of Scotland (NRS) should plan to carry out 
a census in 2021 which is predominately online but which 
should make the best use of admin data where possible. 

e The scope of this census includes continuing to explore the 

future greater use of admin data in collecting socio- 

demographic statistics. 

There may be an opportunity to provide enhanced outputs 

though the use of admin data and this will be explored. 


¢ Make recommendations for future censuses in Scotland. 
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Objectives for the Project 


(1)Access to 
Data Sources 
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(1)What Administrative Data 
do we have access to? 


e NHS Central Register 

e Vital Events — Births, Deaths, Marriages & Civil Partnerships 
e Health Activity Data (contains no medical information!) 

e Electoral Registers 

e Higher Education Student Data (HESA) 

e Scottish Government School Pupil Census 

e Scottish Funding Council Further Education Student Data 

e Registers of Scotland (RoS) Residential Sales 

NRS Geography Data 


Typically first name, last name, date of birth, gender, address, 
postcode. Very limited additional information. 
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Why not use a single source? 


* No single source has 
coverage of the entire 
population 


e Need to identify overlaps 
so we don't count people 


twice. 


e Adjust for under-coverage. 
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(2) Data Quality 


e Is the data we're using correct? 


e Produce Quality Assessment Of | Miitcestssisis 
Admin Data Reports for each | 
Report on internal migration for Scotland, 


source. quality assurance of administrative data used 
in population and migration statistics: 


— E,g, Report on internal migration for ELG 
å Quality assurance undertaken on administrative data for internal migration for Scotland 
Scotland quality a ssurance of eae ee aa (PSD) and Migration Statistics Division 
administrative data used in population 


and migration statistics: January 2018 
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(2) Data Quality 


Meeting with data suppliers — 
e background to the data 
e what checks do they do? 


Our own checks: 


Missing postcodes / do postcodes map to a council area? 
e Check age — sex distribution 

e Check length of first names and last names 

e Compare with previous years 
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(2) Single Year of Age - Compared with 2011 Census 
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Population and Household Estimates 


Population pyramids of Scotland, 1981-2039 
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1. How do we combine datasets to ensure under-coverage 
(if possible). 
2. Begin with population counts using 2011 data. 


3. Use dual system estimation to adjust for under-coverage and 
estimate the complete population. 


4. Compare with the 2011 Census. 


Once working, “productionise” for datasets from 2016 
through to 2021. 


Compare with Census 2021 estimates. 
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Household Estimates 
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1. Trickier than population estimates. Most datasets relate to 
individuals and not their relationships with others. 


2. Relies on mapping address to the UPRN (unique property 
reference number) 


3. Address information on Electoral Register, Health Activity 
and Vital Event datasets. 


These will be occupied property estimates, rather than 
household estimates. 


Follows from the population estimates work. 
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Data Processing Tasks 


e What tasks are worth taking forward into the 2019 Census 
Rehearsal? 


e Testing the following work packages to look at the size of the 
problem, and time required to remedy: 
1. Whole Census De-duplication 
2. Creation of a Synthetic CCS for the 2019 Rehearsal 
3. Matching 2011 Census and 2011 CCS within 20 working days. 
4. Resolving multiple responses within a household. 
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Data Processing Tasks 


e Testing the following work packages to look at the size of the 
problem, and time required to remedy (continued): 
5. Item level quality assurance using dates of birth from admin data 


6. Test of using admin data to help with placement of skeleton records, 
created from estimation and adjustment process 


7. Test of Automatic Linkage methods for large datasets. 


8. Removing false persons — answers is the a real person when limited info has 
been provided. 


e Beneficial tasks may be taken forward to the October 2019 
rehearsal. 
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Summary 


e Gained access to a wide variety of datasets covering a 
significant proportion of the population. 


e Commencing quality assessments of these datasets 


e Testing work packages for how admin data can help with the 
Scotland's Census 2021 
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Contact Details 


e Dr. Andrew Waugh 
— Andrew.Waugh@nrscotland.gov.uk 
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